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BsiHKAI 
I 

TGTGCTCTAAAGCCCCCATTCAAAATGCCATTTAACGCATTTGATACCTTC 
ACACGAGATTTCGGGGGTAAGTTTTACGGTAAATTGCGTAAACTATGGAAGTT^ C ^ ^ 

CALKPPFKMPFNAFDTFKEK 



Smal 



Xmal 
I 



BsmI 

I 



ATTTTGAAACCCGGGAAGGAAGGAGTGAAGAATGCCGTAGGAGATTCGCTGGGGATCTTA 

1 h f. f. . 

TAAAACTTTGGGCCCTTCCTTCCTC ACTTCTTACGGCATCCTCTAAGCGACCCCTAGAAT 

ILKPGKE GVKNAVGDSLGIL 
TaqI 



CAAAGAAAACTCGATGGGACCAACGAGGAGGGAGATGCCATTGAGCTGAGTGAGGAAGGA 

121 + ^ _^ 

GTTTCTTTTGAGCTACCCTGGTTGCTCCTCCCTCTACGGTAACTCGACTCACTCC^ 180 



QRKLDGTNEEGDA 



I E L S E E G 



AlwNI 
PvuII 

Stui Bsgl psti 



^GGCCTGTGCAGACATCCAGAGCCCGAGCCCCTGTGTGCGACTGCAGCTGCTGTGGCATC 
181 + 1 ^ _| 

TCCGGAC ACGTCTGTAGGTCTCGGGCTCGGGGACACACGCTGACGTCGACGACACCGTA^ ^ ^ ^ 

RPVQTSRARAPVCDCSCCGI 

BspHI 



CCCAAGCGGTACATCATCGCTGTCATGAGTGGCCTGGGATTCTGCATTTCCTTTGGGATT 
241 H ^ _| ^ 

GGGTTCGCCATGTAGTAGCGACAGTACTCACCGGACCCTAAGACGTAAAGGAAACCCTAA 

PKRYIIAVMSGLGFCISFGI 

MscI Xcml 
I I 

CGGTGCAACCTTGGAGTGGCCATTGTGGAAATGGTCAACAATAGCACTGTGTATGTGGAT 

301 + ^ . 

' 1 h 360 

GCCACGTTGGAACCTCACCGGTAACACCTTTACCAGTTGTTATCGTGACACATACACCTA 

RCNLGVAIVEMVNNSTVYVD 



Fig. 1 
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BsmBI 

BsaWI BaitiHI 

' I . 

GGGAAACCGGAAATCCAGACAGCACAGTTTAACTGGGATCCAGAGACGGTGGGAAGGGCG 
361 + + + ^ ^ ^ ^20 

CCCTTTGGCCTTTAGGTCTGTCGTGTCAAATTGACCCTAGGTCTCTGCCACCCTTCCCGC 



Bmrl EcoRI 
I I 



GKPEIQTAQFNWDPETVGRA 

Maelll 

Ncol Tsp45l 

I I 

AATTCTCTTATCCATGGATCTTTTTTCTGGGGTTATATTGTGACACAAATTCCCGGTGGC 
421 + + ^ ^ ^ ^ 

TTAAGAGAATAGGTACCTAGAAAAAAGACCCCAATATAACACTGTGTTTAAGGGCCACCG 

NSLIHGSFFWGYIVTQIPGG 

Aatll 
BsaHI 



TTCATTTCAAACAAGTTTGCTGCTAACAGGGTCTTTGGAGCTGCCATCTTCTTGACGTCA 
481 ^ ^ ^ , ^ ^ 

AAGTAAAGTTTGTTCAAACGACGATTGTCCCAGAAACCTCGACGGTAGAAGAACTGCAGT 

FISNKFAANRVFGAAIFLTS 
Xmni SacII 



Pcil 



BstUI 



ACCCTGAACATGTTCATCCCTTCCGCGGCCAGGGTGCATTACGGCTGTGTCATGTGTGTG 
541 + + ^ ^ ^ ^ 

TGGGACTTGTACAAGTAGGGAAGGCGCCGGTCCCACGTAATGCCGACACAGTACACACAC 



TLNMFIPSAARVHYGCV 

Maelll 
AlwNI Tsp45l 



M C V 



AGGATTTTGCAGGGTCTGGTGGAGGGTGTGACCTACCCAGCCTGCCACGGGATGTGGAGT 
601 + + ^ ^ ^ ^ 

TCCTAAAACGTCCCAGACCACCTCCCACACTGGATGGGTCGGACGGTGCCCTACACCTCA 

RILQGLVEGVTYPACHGMWS 

Bpml 

AAGTGGGCACCTCCCCTGGAGAGAAGTCGTCTAGCCACAACCTCTTTTTGTGGTTCCTAT 
661 + + + ^ ^ ^ 

TTCACCCGTGGAGGGGACCTCTCTTCAGCAGATCGGTGTTGGAGAAAAACACCAAGGATA 
KWAPPLERSRLATTSFCGSY 

F/g. 7 cont'd 
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BstAPI TatI 

I I 

GCCGGGGCAGTCGTTGCTATGCCCCTTGCAGGAGTATTGGTGCAGTACATTGGCTGGGCC 
721 + + + + + + 780 

CGGCCCCGTCAGCAACGATACGGGGAACGTCCTCATAACCACGTCATGTAACCGACCCGG 
AGAVVAMPLAGVLVOYIGWA 



Bsgl Pcil 
I I 

TCTGCCTTTTATATTTACGGGATGTTTGGAATTATTTGGTACATGTTTTGGCTGCTGCTG 
781 + + + + + + 840 

AGACGGAAAATATAAATGCCCTACAAACCTTAATAAACCATGTACAAAACCGACGACGAC 



M 



W 



M 



W 



PstI 



HphI 



I 

CAGGCTTATGAGTGTCCAGCAGTTCACCCAACAATATCCAATGAAGAACGGACCTACATA 
841 + + + + + + 900 

GTCCGAATACTCACAGGTCGTCAAGTGGGTTGTTATAGGTTACTTCTTGCCTGGATGTAT 



V H 



N 



E R 



Bbel 
Haell 
Hhal 
Sfol 
BsaHI 
HinPlI 
Narl 
KasI 



MscI 
Bgll 



Ncol 



I 

GAGACAAGTATAGGAGAAGGCGCCAACTTGGCCAGTCTGAGCAAATTCAACACACCATGG 
901 + + + + + + 960 

CTCTGTTCATATCCTCTTCCGCGGTTGAACCGGTCAGACTCGTTTAAGTTGTGTGGTACC 



N 



K 



N 



W 



BstXI 



AGAAGGTTTTTCACATCCTTGCCTGTCTATGCCATTATTGTGGCAAACTTTTGTAGAAGC 
961 + + + + + + 1020 

TCTTCCAAAAAGTGTAGGAACGGACAGATACGGTAATAACACCGTTTGAAAACATCTTCG 



R 



V 



V 



N 



R 
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TGGACCTTCTATTTGCTCTTAATAAGTCAGCCTGCTTACTTTGAAGAGGTCTTTGG^ 
1021 + + + + + + 1080 

ACCTGGAAGATAAACGAGAATTATTCAGTCGGACGAATGAAACTTCTCCAGAAACCCAAA 
WTFYLLLISQPAYFEEVFGF 

Bsal PvuII Dram BsaBI HphI Acc65I 

'I I I I I 

GCAATAAGTAAGGTGGGTCTCTTGTCAGCTGTCCCACACATGGTGATGACAATCGTGGTA 
1081 + + + + + + 1140 

CGTTATTCATTCCACCCAGAGAACAGTCGACAGGGTGTGTACCACTACTGTTAGCACCAT 

AISKVGLLSAVPHMVMTIVV 

Kpnl 
I 

CCCATTGGAGGACAACTGGCTGATTATTTAAGAAGCCGAAAGATTTTGACCACAACTGCT 
1141 + + + + + + 1200 

GGGTAACCTCCTGTTGACCGACTAATAAATTCTTCGGCTTTCTAAAACTGGTGTTGACGA 

PIGGQLADYLRSRKILTTTA 
BspHI 



GTCAGAAAGATCATGAATTGTGGAGGCTTTGGCATGGAGGCAACCTTGCTCCTGGTGGTT 
1201 + + + + + + 1260 

CAGTCTTTCTAGTACTTAACACCTCCGAAACCGTACCTCCGTTGGAACGAGGACCACCAA 

VRKIM NCGGFGMEATLLLVV 

BstXI 



GGGTTTTCCCATACCAAAGGAGTGGCTATCTCCTTCCTGGTGCTTGCTGTAGGATTTAGT 
1261 + + + + + + 1320 

CCCAAAAGGGTATGGTTTCCTCACCGATAGAGGAAGGACCACGAACGACATCCTAAATCA 

GFS HTKGVAISFLVLAVGFS 

GGCTTTGCAATTTCAGGTTTCAATGTCAACCACCTGGACATTGCTCCACGATATGCCAGC 
1321 + + + + + + 1380 

CCGAAACGTTAAAGTCCAAAGTTACAGTTGGTGGACCTGTAACGAGGTGCTATACGGTCG 

GFAISGFNVNHLDIAPRYAS 

Xcml 
I 

ATCCTCATGGGGATCTCAAATGGCGTGGGAACCCTCTCTGGAATGGTTTGTCCCCTCATT 
1381 + + + + + + 1440 

TAGGAGTACCCCTAGAGTTTACCGCACCCTTGGGAGAGACCTTACCAAACAGGGGAGTAA 
ILMGISNGVGTLSGMVCPLI 



Fig. 1 cont'd 
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1561 



Smal 

Xmal Xmnl 

I I 

GTTGGTGCAATGACAAAGCACAAGACCCGGGAAGAATGGCAGAATGTGTTCCTCATAGCA 

1441 + + + + + + 1500 

CAACCACGTTACTGTTTCGTGTTCTGGGCCCTTCTTACCGTCTTACACAAGGAGTATCGT 

VGAMTKHKTREEWQNVFLIA 

BsiHKAI Mlyl 
ApaLI Plel 

I I 

GCCCTGGTGCACTACAGTGGAGTCATCTTCTACGGGGTCTTTGCTTCTGGGGAAAAACAG 

1501 + + + + + + 1560 

CGGGACCACGTGATGTCACCTCAGTAGAAGATGCCCCAGAAACGAAGACCCCTTTTTGTC 

ALVHYSGVIFYGVFASGEKQ 

Bmrl BplI 

I I 

GACTGGGCTGATCCAGAGAATCTCTCTGAGGAGAAATGTGGAATCATTGACCAAGATGAA 

^ + + + + + 1620 



CTGACCCGACTAGGTCTCTTAGAGAGACTCCTCTTTACACCTTAGTAACTGGTTCTACTT 
a DWADPENLSEEKCGIIDQDE 

• ^ » 

r/ BssSI 



a 



TTAGCCGAGGAAACAGAACTCAACCACGAGGCTTTCGTAAGTCCCAGAAAGAAGATGTCT 

1621 + + + + + + 1680 

AATCGGCTCCTTTGTCTTGAGTTGGTGCTCCGAAAGCATTCAGGGTCTTTCTTCTACAGA 

LAEETELNHEAFVSPRKKMS 



Jf" Bbsl EcxI 

□ I I 

TATGGAGCCACCACCCAGAATTGTGAGGTCCAGAAGACGGATCGGAGACAACAGAGAGAA 

1681 + + + + + + 1740 

ATACCTCGGTGGTGGGTCTTAACACTCCAGGTCTTCTGCCTAGCCTCTGTTGTCTCTCTT 

a YGATTQNCEVQKTDRRQQRE 

TaqI 



TCCGCCTTCGAGGGGGAGGAGCCATTATCCTACCAGAATGAAGAGGACTTTTCAGAAACA 

1741 + + + + + + 1800 

AGGCGGAAGCTCCCCCTCCTCGGTAATAGGATGGTCTTACTTCTCCTGAAAAGTCTTTGT 

SAFEGEEPLSYQNEEDFSET 



Fig. 1 cont'd 
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BbvCI 
BpulOI 

I 

TCTTAACGTGCATCTTCCCCTCAGCTTACAACCAGAAGTCTCCACACCCATTGCTTTTCC 

1801 + + + + + + 1860 

AGAATTGCACGTAGAAGGGGAGTCGAATGTTGGTCTTCAGAGGTGTGGGTAACGAAAAGG 

S*RASSPQLTTRSLHTHCFS 

Pf IMI 
I 

CATACCTTGGCCTTCCAGGGGGCCAAATCACAGGAAAGGGGGAGACTAAATCAACAACAG 

1861 + + + + + + 1920 

GTATGGAACCGGAAGGTCCCCCGGTTTAGTGTCCTTTCCCCCTCTGATTTAGTTGTTGTC 

HTLAFQGAKSQERGRLNQQQ 

Bsal 



AGAAGAAAAATGCCTTCTTACAAAGATGGGCGTATGGATCTTGGTCTCAGTTAATTAGAT 

1921 + H h h + + 1980 

^ TCTTCTTTTTACGGAAGAATGTTTCTACCCGCATACCTAGAACCAGAGTCAATTAATCTA 

a. RRKMPSYKDGRMDLGLS*LD 



Bell Mfel 

I I 

AGTTGATCATATTTTTTTTGGGGGGGGCAATTGGGCATTGGCTGTTGAGCCTTCTCTCAA 

1981 + + + + + + 2040 

TCAACTAGTATAAAAAAAACCCCCCCCGTTAACCCGTAACCGACAACTCGGAAGAGAGTT 

S*SYFFWGGQLGIGC*AFSQ 

AAGAACAATTTATTCAGGAAGAAATGGCTAGAAGAATAAGGAGTGGCTTGTTGCTCAAAT 

2041 + + + + + + 2100 

TTCTTGTTAAATAAGTCCTTCTTTACCGATCTTCTTATTCCTCACCGAACAACGAGTTTA 

KNNLFRKKWLEE*GVACCSN 

Eco57I TatI Bpml 

I I I 

AAACACTGAAGAAATCCCTCTTTGGTCTGGAGAAGAGTACATGGTGGTTGCCACCCCATC 

2101 + + + + + + 2160 

TTTGTGACTTCTTTAGGGAGAAACCAGACCTCTTCTCATGTACCACCAACGGTGGGGTAG 

KH*RNPSLVWRRVHGGCHPI 



Fig. 1 cont'd 
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EC00109I 

EcoRV EcoNI PspOMI 

I I I 

TCCAAGGATATCCATGTAGAGGACAATCTCTGCAACCTAATGAAGGGAATCACTCATGGG 

2161 + + + + + + 2220 

AGGTTCCTATAGGTACATCTCCTGTTAGAGACGTTGGATTACTTCCCTTAGTGAGTACCC 

SKDIHVEDNLCNLMKGITHG 

Apal 
EC00109I I 

I I 

GGCCCTTGGTTGTGCCAGGTGCTTTATGAACATTCTTATTTAACTCCCACACCCTAATAT 

2221 + + + + + + 2280 

CCGGGAACCAACACGGTCCACGAAATACTTGTAAGAATAAATTGAGGGTGTGGGATTATA 

GPWLCQVLYEHSyLTPTP*Y 

AGTTATTGTACCCATTTTACAACTAAGAACATTAAATGACTAGGTTGGCCCACCCAAGGT 

2281 + + + + + + 2340 

TCAATAACATGGGTAAAATGTTGATTCTTGTAATTTACTGATCCAACCGGGTGGGTTCCA 

SYCTHFTTKNIK*LGWPTQG 

TGTCCTCTCAGAGCCAAAGCTGAGACTGGCAGATGACCAGGAGTTTTAGGAAGGAAGGAA 

2341 + + + + + + 2400 

ACAGGAGAGTCTCGGTTTCGACTCTGACCGTCTACTGGTCCTCAAAATCCTTCCTTCCTT 

CPLiRAKAETGR*PGVLGRKE 

GGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGGTTCAGTTGAGTGT 

2401 + + + + + + 2460 

CCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCCAAGTCAACTCACA 

GRKEGRKEGRKEGRKGSVEC 

AGGGTCATTTTCAATGACAAAAACAAAAACTGGAATCAGTTGGTTTGTGGGTAATTCCAT 

2461 + + + + + + 2520 

TCCCAGTAAAAGTTACTGTTTTTGTTTTTGACCTTAGTCAACCAAACACCCATTAAGGTA 

RVIFNDKNKNWNQLVCG*FH 

SphI 



GTTTGGTCAAGGGTGTGTGCATGCAAACGTGTATGTGCGTGTGTGTGTGTTTGTGTGTTT 

2521 + + + + + + 2580 

CAAACCAGTTCCCACACACGTACGTTTGCACATACACGCACACACACACAAACACACAAA 

VWSRVCACKRVCACVCVCVF 



Fig. 1 cont'd 
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GNGTGTNAGNNNGNATNANAANAAAAN 

2581 + + 2607 

CNCACANTCNnsnsrCNTANTNTTNTTTTTN 



Fig. 1 cont'd 
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Fig. 2 A 
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Fig. 5 A 



Fig. 5 B 



Fig. 5C 



Fig. 5 A, 5B, and 5C 
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Fig. 5D 
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